Model Selection

Edge device optimization

# Edge device optimization

phi-2 is a text generation model employing IQ-DynamicGate ultra-low bit quantization (1-2 bits), suitable for natural language processing and code generation tasks.

Large Language Model Supports Multiple Languages

Orpheus 3b 0.1 Ft GGUF

An ultra-low bit quantized model optimized based on the Llama-3-8B architecture, utilizing IQ-DynamicGate technology for adaptive 1-2 bit precision quantization, suitable for memory-constrained environments.

Large Language Model English

Olympiccoder 32B GGUF

OlympicCoder-32B is a code generation model based on Qwen2.5-Coder-32B-Instruct, employing IQ-DynamicGate ultra-low-bit quantization technology for efficient inference in memory-constrained environments.

Large Language Model English

EXAONE Deep 32B GGUF

EXAONE-Deep-32B is a 32B-parameter large language model supporting English and Korean, specifically designed for text generation tasks.

Large Language Model Supports Multiple Languages

Llama 3.1 Nemotron Nano 8B V1 GGUF

An 8B parameter model based on the Llama-3 architecture, optimized for memory usage with IQ-DynamicGate ultra-low bit quantization technology

Large Language Model English

EXAONE Deep 7.8B GGUF

A 7.8B-parameter model featuring ultra-low-bit quantization (1-2 bits) using IQ-DynamicGate technology, supporting English and Korean text generation tasks.

Large Language Model Supports Multiple Languages

Mistral Small 3.1 24B Instruct 2503 GGUF

This is an instruction-tuned model based on Mistral-Small-3.1-24B-Base-2503, utilizing GGUF format and IQ-DynamicGate ultra-low bit quantization technology.

Large Language Model Supports Multiple Languages

Qwen2.5 7B Instruct 1M GGUF

Qwen2.5-7B-Instruct-1M is an instruction-tuned version based on Qwen2.5-7B, employing IQ-DynamicGate ultra-low-bit quantization (1-2 bits), suitable for efficient inference in memory-constrained environments.

Large Language Model English

Llama 3.1 8B Instruct GGUF

Llama-3.1-8B-Instruct is an instruction-tuned version based on Llama-3-8B, utilizing IQ-DynamicGate technology for ultra-low-bit quantization (1-2 bits), enhancing accuracy while maintaining memory efficiency.

Large Language Model Supports Multiple Languages

Mistral 7B Instruct V0.2 GGUF

Mistral-7B-Instruct-v0.2 is an instruction-tuned model based on the Mistral-7B architecture, supporting text generation tasks, optimized for memory efficiency using IQ-DynamicGate ultra-low bit quantization technology.

Large Language Model

Reasonablellama3 3B Jr

A fine-tuned reasoning model based on LLaMA-3B, enhanced with reasoning capabilities and multilingual processing support

Large Language Model Supports Multiple Languages

Tiny Agent A 3B

Mini Agent-α is a lightweight AI agent trained on the Qwen2.5-Coder model series, specifically designed for edge devices, supporting Pythonic function calling methods.

Large Language Model Supports Multiple Languages

Comment Moderation

A multi-label content moderation system built on the DistilBERT architecture for detecting and classifying potentially harmful content in user comments, featuring high accuracy and lightweight characteristics.

Text Classification

Transformers English

Tiny Hinglish Chat 21M

A micro Hindi-English mixed dialogue text completion model capable of conversing on daily life topics in Hinglish.

Dialogue System

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase